Sub-band weighted projection measure for robust sub-band speech recognition

نویسندگان

Babak Nasersharif

Ahmad Akbari

چکیده

In recent years, sub-band speech recognition has been found useful in robust speech recognition, especially for speech signals contaminated by band-limited noise. In sub-band speech recognition, full band speech is divided into several frequency sub-bands and then sub-band feature vectors or their generated likelihoods by corresponding sub-band recognizers are combined to give the result of recognition task. In this paper, we concatenate sub-band feature vectors, where we extract phase autocorrelation (PAC) MFCC, as noise robust features, from each sub-band. Furthermore, we extend a model adaptation method, named sub-band weighted projection measure (SWPM), to adapt HMM Gaussian mean vectors to concatenated sub-band feature vectors in noisy conditions. The experimental results indicate that the proposed method significantly improves the sub-band speech recognition system performance in presence of additive noise.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

IMPROVED HMM ENTROPY FOR ROBUST SUB−BAND SPEECH RECOGNITION (ThuPmOR1)

In recent years, sub−band speech recognition has been found useful in robust speech recognition, especially for speech signals contaminated by band−limited noise. In sub−band speech recognition, full band speech is divided into several frequency sub−bands and then sub−band feature vectors or their generated likelihoods by corresponding sub−band recognizers are combined to give the result of rec...

متن کامل

Multi-Channel Sub-Band Speech Recognition

Two distinct fields of research into robust speech recognition are the use of microphone arrays for signal enhancement and the use of independent frequency sub-band models for robust recognition. In this article, we propose and investigate the integration of these two techniques on two different levels. First, a broad-band beamforming microphone array allows for natural integration with sub-ban...

متن کامل

Maximum likelihood sub-band weighting for robust speech recognition

Sub-band speech recognition approaches have been proposed for robust speech recognition, where full-band power spectra are divided into several sub-bands and then likelihoods or cepstral vectors of the sub-bands are merged depending on their reliability. In conventional sub-band approaches, correlations across the sub-bands are not modeled and the merging weights can only be set experientially ...

متن کامل

Maximum likelihood sub-band adaptation for robust speech recognition

Noise-robust speech recognition has become an important area of research in recent years. In current speech recognition systems, the Mel-frequency cepstrum coefficients (MFCCs) are used as recognition features. When the speech signal is corrupted by narrow-band noise, the entire MFCC feature vector gets corrupted and it is not possible to exploit the frequency-selective property of the noise si...

متن کامل

Multi-resolution cepstral features for phoneme recognition across speech sub-bands

Multi-resolution sub-band cepstral features strive to exploit discriminative cues in localised regions of the spectral domain by supplementing the full bandwith cepstral features with subband cepstral features derived from several levels of sub-band decomposition. Mult-iresolution feature vectors, formed by concatenation of the subband cepstral features into an extended feature vector, are show...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Sub-band weighted projection measure for robust sub-band speech recognition

نویسندگان

چکیده

منابع مشابه

IMPROVED HMM ENTROPY FOR ROBUST SUB−BAND SPEECH RECOGNITION (ThuPmOR1)

Multi-Channel Sub-Band Speech Recognition

Maximum likelihood sub-band weighting for robust speech recognition

Maximum likelihood sub-band adaptation for robust speech recognition

Multi-resolution cepstral features for phoneme recognition across speech sub-bands

عنوان ژورنال:

اشتراک گذاری